Performance Comparison of Imputation Methods for Mixed Data Missing at Random with Small and Large Sample Data Set with Different Variability

نویسندگان

چکیده

One of the concerns in field statistics is presence missing data, which leads to bias parameter estimation and inaccurate results. However, multiple imputation procedure a remedy for handling data. This study looked at best methods used handle mixed variable datasets with different sample sizes variability along levels missingness. The employed predictive mean matching, classification regression trees, random forest methods. For each dataset, estimates complete were compared found imputed dataset. results showed that method was mostly 500 irrespective variability. tree worked on 30

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of different estimation methods for missing rainfall data

There are numerous methods to estimate missing values of which some are used depending on the data type and regional climatic characteristics. In this research, part of the monthly precipitation data in Sarab synoptic station, east Azerbaijan province, Iran was randomly considered missing values. In order to study the effectiveness of various methods to estimate missing data, by seven classic s...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

Comparison of different methods for longitudinal data with missing observations

COMPARISON OF DIFFERENT METHODS FOR LONGITUDINAL DATA WITH MISSING OBSERVATIONS Lin Sun July 27, 2010 Longitudinal studies occupy an important role in scientific researches and clinical trials. When taking the analysis of longitudinal data, investigators are often confronted with missing data which will produce potential biases, even in well-controlled condition. In the literature, missing data...

متن کامل

Parametric fractional imputation for mixed models with nonignorable missing data

Inference in the presence of non-ignorable missing data is a widely encountered and difficult problem in statistics. Imputation is often used to facilitate parameter estimation, which allows one to use the complete sample estimators on the imputed data set. We develop a parametric fractional imputation (PFI) method proposed by Kim (2011), which simplifies the computation associated with the EM ...

متن کامل

Comparison of missing value imputation methods for crop yield data

Most ecological data sets contain missing values, a fact which can cause problems in the analysis and limit the utility of resulting inference. However, ecological data also tend to be spatially correlated, which can aid in estimating and imputing missing values. We compared four existing methods of estimating missing values: regression, kernel smoothing, universal kriging, and multiple imputat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Asian Journal of Probability and Statistics

سال: 2022

ISSN: ['2582-0230']

DOI: https://doi.org/10.9734/ajpas/2022/v20i2416